Direct Information Reweighted by Contact Templates: Improved RNA Contact Prediction by Combining Structural Features
نویسندگان
چکیده
It is acknowledged that co-evolutionary nucleotide-nucleotide interactions are essential for RNA structures and functions. Currently, direct coupling analysis (DCA) infers nucleotide contacts in a sequence from its homologous sequence alignment across different species. DCA and similar approaches that use sequence information alone usually yield a low accuracy, especially when the available homologous sequences are limited. Here we present a new method that incorporates a Restricted Boltzmann Machine (RBM) to augment the information on sequence co-variations with structural patterns in contact inference. We thus name our method DIRECT that stands for Direct Information REweighted by Contact Templates. Benchmark tests demonstrate that DIRECT produces a substantial enhancement of 13% in accuracy on average for contact prediction in comparison to the traditional DCA. These results suggest that DIRECT could be used for improving predictions of RNA tertiary structures and functions. The source codes and dataset of DIRECT are available at http:// http://zhao.phy.ccnu.edu.cn:8122/DIRECT/index.html.
منابع مشابه
Predicting membrane protein contacts from non-membrane proteins by deep transfer learning
Computational prediction of membrane protein (MP) structures is very challenging partially due to lack of sufficient solved structures for homology modeling or parameter estimation of computational methods. Recently direct evolutionary coupling analysis (DCA) sheds some light on protein contact prediction and accordingly, contact-assisted folding, but DCA is effective only on some very large-si...
متن کاملHidden conformations in protein structures
MOTIVATION Prediction of interactions between protein residues (contact map prediction) can facilitate various aspects of 3D structure modeling. However, the accuracy of ab initio contact prediction is still limited. As structural genomics initiatives move ahead, solved structures of homologous proteins can be used as multiple templates to improve contact prediction of the major conformation of...
متن کاملThe Prediction of the Tensile Strength of Sandstones from their petrographical properties using regression analysis and artificial neural network
This study investigates the correlations among the tensile strength, mineral composition, and textural features of twenty-ninesandstones from Kouzestan province. The regression analyses as well as artificial neural network (ANN) are also applied to evaluatethe correlations. The results of simple regression analyses show no correlation between mineralogical features and tensile strength.However,...
متن کاملBeyond the Twilight Zone: automated prediction of structural properties of proteins by recursive neural networks and remote homology information.
The prediction of 1D structural properties of proteins is an important step toward the prediction of protein structure and function, not only in the ab initio case but also when homology information to known structures is available. Despite this the vast majority of 1D predictors do not incorporate homology information into the prediction process. We develop a novel structural alignment method,...
متن کاملA comprehensive assessment of sequence-based and template-based methods for protein contact prediction
MOTIVATION Pair-wise residue-residue contacts in proteins can be predicted from both threading templates and sequence-based machine learning. However, most structure modeling approaches only use the template-based contact predictions in guiding the simulations; this is partly because the sequence-based contact predictions are usually considered to be less accurate than that by threading. With t...
متن کامل